Using Deep Learning to Annotate Karaoke Songs

نویسندگان

  • Juliette Faille
  • Yuyi Wang
چکیده

Karaoke is a game in which players sing over pre-recorded instrumental backing tracks. To help the singer, lyrics are usually displayed on a video screen. The synchronization between the lyrics display and the song record, often done manually, is a tedious and time-consuming task. Automation of the annotation of karaoke songs can help save time and effort. In this thesis we use the representation of songs as spectrograms to detect singing times. This timing information can be used later to align the lyrics display with a sound track. Convolutional neural networks are trained to detect at any moment in a song whether the artist is singing or not.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient karaoke song recommendation via multiple kernel learning approximation

Online karaoke allows users to practice singing and distribute recordings. Different from traditional music recommendation, online karaoke need to consider users’ vocal competence besides their tastes. In this paper, we develop a karaoke recommender system by taking into account vocal competence. Alone this line, we propose a joint modeling method named MKLA by adopting bregman divergence as th...

متن کامل

Personalized Karaoke

In this paper, a personalized Karaoke system, P-Karaoke, is proposed. In the P-Karaoke system, personal home videos and photographs, which are automatically selected from users’ multimedia database according to their content, users’ preferences or music, are utilized as the background videos of the Karaoke. The selected video clips, photographs, and lyrics that obtained from Lyric Service or ma...

متن کامل

On the Effect of Using Games, Songs, and Stories on Young Iranian EFL Learners' Achievement

     The objective of the present study was to identify and examine the influence of instructional tools, namely, games, songs and stories on young Iranian EFL learners’ achievement utilizing a quantitative design. To conduct the study 65 Iranian EFL learners, divided into an experimental group and a control group, learning English at Navid English Institute, Shiraz, Iran, participated in the s...

متن کامل

Singer Traits Identification using Deep Neural Network

The author investigates automatic recognition of singers’ gender and age through audio features using deep neural network (DNN). Features of each singing voice, fundamental frequency and Mel-Frequency Cepstrum Coefficients (MFCC) are extracted for neural network training. 10,000 singing voice from Smule’s Sing! Karaoke app is used for training and evaluation, and the DNN-based method achieves a...

متن کامل

A Music Retrieval System Based on Query-by-Singing for Karaoke Jukebox

This paper investigates the problem of retrieving Karaoke music by singing. The Karaoke music encompasses two audio channels in each track: one is a mix of vocal and background accompaniment, and the other is composed of accompaniment only. The accompaniments in the two channels often resemble each other, but are not identical. This characteristic is exploited to infer the vocal’s background mu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016